CDS
Accession Number | TCMCG075C25919 |
gbkey | CDS |
Protein Id | XP_017983370.1 |
Location | join(4599208..4599301,4600209..4600322,4600403..4600711,4600796..4600852,4600948..4601004,4601102..4601242,4601333..4601473,4601563..4601670,4601793..4601870,4601959..4602042,4602204..4602332,4602423..4602638,4602733..4602804,4602913..4603005,4603103..4603210,4603329..4603700,4603788..4604084,4604369..4604758,4604869..4605112,4605678..4606368) |
Gene | LOC18588393 |
GeneID | 18588393 |
Organism | Theobroma cacao |
Protein
Length | 1264aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018127881.1 |
Definition | PREDICTED: receptor-like serine/threonine-protein kinase ALE2 isoform X3 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGCTTTATGGGATGGGGATGTTGATGCCACTGATTCTCCAGCTTGCCAGACTCTGCATCATTGGCTTTCCTTTAATTGTTCAGGAATCTTCAGGGCACAATGCATCCCCGTCTCCAGCAAAGTTTTTTATGTTTCCTCCTGCAGAAGGAATCCCTAGTGCTGTTGAACAAAGAAGCGATGCACCAAACACACTCTCTCAGCCAAATGATTTGCATTCTCCTCCAGCATTACCACCTCTTATGTCTGCCTCAGTGCCTGAAACGACTGAAGGGCATGCACGTTCCTTTTCCCCGAGTAATTCAATGGAATTGCCACCATATAATACAGCTCCGCCTCCAGTTACCGTTGAAGAAGGTGTACCATCCTTGGCACCAAGTACTCCTGCGGTGTTGCCACCATTTGATACAGCTCCTCCACCTATGCTTGTTCAAGTACACACACCATCAAAGTCACCAACTGCTCTGCAGAAAAAGGAACCAATTATGAAGTCTCCTCCCTCAGTGCCAGATGCTCCAGCTCCAGTTGCATCACCCTCAAGGAATTTGCCTCAGAATTCACCAGCTATCCACCCATTTCCATCAATAACACCAACACAGAATTCTCCAGAAAATTCACCAGTTGTCCATCAAACTCCATTTGCACCGCCATTGAGGAATCCAGCACAAAATTCCCCACCCACTCAATCCAGTAGGCGAAGTGCTTTCCCCCCAATTTCTAATCAAAGAAATTCATCAAGTAATAGGGCACCTGTTTTGGAGCCAACTGCTCCAGCTCCAGTTGCACCACCATTGAGGAATCCACCACAAAATTCCCCAGCCATTCACTTCAGTAAGCCACATGCCTTGCCCCCAAGTGCCAATCAAGGAAATTCATCAAATAAGACAGCACCTGTTATGGAGCCAATTGCTCCAGTTCCAGTCGCAACACCTTCGGGAAATTCACCACGAAATCAAACAGCCATTCACCCAAGGGGGCCTGCTTTAGCACCAAGTGTTCCTATTCCAGAGCCCACTGCTCCAGCAGTTGCATCTCCCCCAAGGAAATTAGAAAGGACCACACCACCTGTCCACTCAATCATGCCACCGTCCATTTTACCAGTTGTATCACCACCAGAAGAATCACCCCACATTTCACCAACCATCCACCCAAATGTGCCAGAAGGAACTCCATCTCAGTTACCAGATCCTGATATCTCTCCTGTTTCAACTCCACCCTCGAGCATTAACTGGAAAAATGATGGAATACCAGTTGCGTCACCTAGGAATGAGATACACAAGCCAATGCCACCTTTGAGCCATACCCCAGAAAATGGTTCTTCCTCAGCCAAATCTCCTTTGGCACCCAAAGCTGTGAGGCACCCTGGTAATTCTCCTGTTTCATCTCTTGCTCCTTCAAATAAAGGGTATAACCCTCCTGCATTGTCACCTTCAATCTCGTTTCATAAGCATCAACATAAAAGAAATGGAAGAACCAGTCCTGCTCCTGCATCATCATACCTGATTTCACCTCCTCCTCTGAAACAGCAAGGTCCAGTTATCTCTCCAGCATTTCTTCCTGGAAGAAGGCGAAGACACTATGCTCCTGCACCTCTTCATTCTGTTTCCCCATCCCATTCTGCTGTTCCTTCGTCAGCAGGCACTGTTTCTCCTGTTCCTTCACCATCTCCAATGACTGCATCTAGGCAGACCAAAATGCCACTTAGCCCCCCAAAAGTTTCTCCTTCTGTGTCGCCATTGAGGAGTCCCAAGGTGCCACCTCCGCCACCAGTTATGTCATTTCCACCTCCACCTCCTAATGAAGATTGTTCAACAACTATTTGCACAGAACCTTATACAAATACACCTCCTGGATCTCCTTGTGGCTGTGTCTTGCCAATGCAAGTTGGATTACGCCTGAGTGTTGCTCTCTATACTTTCTTCCCTTTGGTTTCTGAGCTGGCAACAGAAATTGCTGCTGGAGTTTTTATGAGACAAAGTCAAGTTCGCATTATTGGAGCTAATGCTGCTAGTGAGCAACCCGAGAAGACAGTTGTCCTTATAGACTTGGTACCGCTTGGTGAAAAATTTGATAACACCACAGCCTTTTTAACTTATCAGAGATTTTGGCATAAACAAGTTGCTATAAAAACTTCATTTTTTGGGGATTATGAAGTATTATATGTGCACTATCTTGGTTTGCCTCCTTCTCCACCTTTGCCCCCTTCCGACATTGATATAATGGATGCTGGACCATATTCTGGTAATGACAACAATGCGAGGGCTATAAAGCCCCTTGGTGTTGATGTGCATGGAAAGCGGCATAAAAATGTGCTTAGTGGCGGCGTGATTGCTATAATTGTTCTGTCTGCTTTGGTGGCTATGGTGTTATGCTCTGCCATTGCATGGGTTTTGCTTTTCAGACGTACAAATCATGCTAGTCAACAAGCAGCAACTACACAGCCTCCGCAAACATCTCTTGCCAAACCATCAGGTTCTGCTGGGTCAATGGTTGGAAGCGGTCTAAGTTCCACATCACTGTCATTTGGCTCTAGCATTGTAGCTTATACAGGATCTGCAAAGACCTTCAGTACAAGTGATATAGAAAAAGCCACTAACAATTTTGATGCTTCAAGAATACTTGGGGAAGGTGGATTTGGTCGTGTTTATAGTGGTGTTCTTGAAGATGGAACTAAAGTGGCAGTCAAAGTTCTGAAGAGAGATGATCAGCAAGGTGGCAGGGAATTCTTGGCTGAGGTGGAGATGCTTAGCCGTCTTCACCACAGAAACTTGGTGAAGTTGATTGGTATATGCACAGAGGAGCGCAACCGCTGCTTGGTTTATGAACTCATTCCAAATGGCAGTGTTGAATCTCACTTGCATGGAGTTGACAAGGATTCTGCACCACTTGACTGGGATGCCCGGATAAAGATAGCCCTTGGTGCTGCTCGTGGGCTGGCTTATCTCCATGAAGATTCAAGCCCACGTGTCATCCACCGGGATTTTAAGTCAAGCAACATCTTGTTGGAGCATGATTTCACACCAAAAGTGTCTGACTTTGGTTTGGCTCGAACTGCCATGGACGAGGAAGGCAGGCACATATCAACACGTGTCATGGGAACTTTTGGGTATGTGGCTCCTGAGTATGCAATGACTGGCCATCTACTTGTGAAGAGTGATGTTTACAGCTATGGTGTAGTCCTTCTTGAGCTTCTGACGGGGAGAAAACCAGTAGATATGACACAGCCACCAGGTCAGGAGAATCTAGTTGCATGGGCTCGTCCTCTTCTCACAACCAAAGAAGGGTTAGAAACAATTATAGATCCATCGCTAAGTTCTGATGTTCCTTTTGATAGTGTGGCCAAAGTAGCAGCAATAGCTTCAATGTGTGTTCAACCTGAGGTATCACACCGACCTTTTATGGGTGAGGTTGTTCAGGCTTTAAAACTAGTAAGTAACGAATGCGATGAGGCAAAAGAAGTAGGTTCAAGATGTTCTAGTCAGGATGATTTGTCCATTGAACTGGATGCTAGGGTTAGTACTGGCTCAGGACAATTGGCTGATCCCTTGCAAAGCCATTATTTGATTCCTAACTATGACACTGGTCTTGATACTGAGAGAGGACTATCGGTGTCAGATTTGTTTAGTTCATCAGCAAGATTTGGAAGGCAATCATCTGGGTCATTTCGGAGGCACTGTAGCTCAGGTCCCCTGAGAACAGCAACGGGCAGTCGTTTCTGGCAAAAAGTGCAAAGATTGTCTAGGGGCAGCATCAGTGAACATGGTGTTATGATGAGGTTCTGGCCAGGGTCACATTGA |
Protein: MLYGMGMLMPLILQLARLCIIGFPLIVQESSGHNASPSPAKFFMFPPAEGIPSAVEQRSDAPNTLSQPNDLHSPPALPPLMSASVPETTEGHARSFSPSNSMELPPYNTAPPPVTVEEGVPSLAPSTPAVLPPFDTAPPPMLVQVHTPSKSPTALQKKEPIMKSPPSVPDAPAPVASPSRNLPQNSPAIHPFPSITPTQNSPENSPVVHQTPFAPPLRNPAQNSPPTQSSRRSAFPPISNQRNSSSNRAPVLEPTAPAPVAPPLRNPPQNSPAIHFSKPHALPPSANQGNSSNKTAPVMEPIAPVPVATPSGNSPRNQTAIHPRGPALAPSVPIPEPTAPAVASPPRKLERTTPPVHSIMPPSILPVVSPPEESPHISPTIHPNVPEGTPSQLPDPDISPVSTPPSSINWKNDGIPVASPRNEIHKPMPPLSHTPENGSSSAKSPLAPKAVRHPGNSPVSSLAPSNKGYNPPALSPSISFHKHQHKRNGRTSPAPASSYLISPPPLKQQGPVISPAFLPGRRRRHYAPAPLHSVSPSHSAVPSSAGTVSPVPSPSPMTASRQTKMPLSPPKVSPSVSPLRSPKVPPPPPVMSFPPPPPNEDCSTTICTEPYTNTPPGSPCGCVLPMQVGLRLSVALYTFFPLVSELATEIAAGVFMRQSQVRIIGANAASEQPEKTVVLIDLVPLGEKFDNTTAFLTYQRFWHKQVAIKTSFFGDYEVLYVHYLGLPPSPPLPPSDIDIMDAGPYSGNDNNARAIKPLGVDVHGKRHKNVLSGGVIAIIVLSALVAMVLCSAIAWVLLFRRTNHASQQAATTQPPQTSLAKPSGSAGSMVGSGLSSTSLSFGSSIVAYTGSAKTFSTSDIEKATNNFDASRILGEGGFGRVYSGVLEDGTKVAVKVLKRDDQQGGREFLAEVEMLSRLHHRNLVKLIGICTEERNRCLVYELIPNGSVESHLHGVDKDSAPLDWDARIKIALGAARGLAYLHEDSSPRVIHRDFKSSNILLEHDFTPKVSDFGLARTAMDEEGRHISTRVMGTFGYVAPEYAMTGHLLVKSDVYSYGVVLLELLTGRKPVDMTQPPGQENLVAWARPLLTTKEGLETIIDPSLSSDVPFDSVAKVAAIASMCVQPEVSHRPFMGEVVQALKLVSNECDEAKEVGSRCSSQDDLSIELDARVSTGSGQLADPLQSHYLIPNYDTGLDTERGLSVSDLFSSSARFGRQSSGSFRRHCSSGPLRTATGSRFWQKVQRLSRGSISEHGVMMRFWPGSH |